Adaptive schemes for incomplete quantum process tomography 
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We propose an iterative algorithm for incomplete quantum process tomography, with the help of 
quantum state estimation, based on the combined principles of maximum-likelihood and maximum- 
entropy. The algorithm yields a unique estimator for an unknown quantum process when one has 
less than a complete set of linearly independent measurement data to specify the quantum process 
uniquely. We apply this iterative algorithm adaptively in various situations and so optimize the 
amount of resources required to estimate the quantum process with incomplete data. 
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I. INTRODUCTION 

Quantum process tomography (QPT) is an important 
tool to characterize the operation of a given quantum 
channel [H-Q. Such a characterization is needed, for ex- 
ample, when one attempts to construct a quantum chan- 
nel comprising multiple logic gates, each carrying out a 
specific quantum process. One such quantum channel for 
entanglement distillation, for instance, would consist of 
CNOT gates. A physical quantum process is described by 
a completely-positive map M.. That is, given a particu- 
lar input quantum state p\ residing in the Dj-dimensional 
Hilbcrt space H, the resulting output state p Q in the Do- 
dimensional Hilbcrt space K, is given by 



Po 



M( P = J2 K mPi K L 



(1) 



with the Kraus operators K m satisfying the relation 
Y^m^m^m = Ijc- The K m s are not unique and any 
other set of Kraus operators 



i 



(2) 



where the u m i m s are the elements of a unitary matrix, 
also parameterizes the completely-positive map M d[. 

The idea behind QPT is to estimate such completely- 
positive maps with measurements. Much like quantum 
state tomography, the estimation of an unknown quan- 
tum process can be perceived as the estimation of a posi- 
tive Choi-Jamiolkowski operator E tmc that is represented 
by a DiD Q x DiD matrix Q. Such an operator contains 
all accessible information about the quantum process. 
The standard QPT procedure involves the measurement 
of multiple copies of L different output states, with each 
output state corresponding to one of the L linearly in- 
dependent input states pf\ thereby using a probability 
operator measurement (POM) of, say, M outcomes. The 
unknown operator Etrue is estimated by linear-inversion 
of the LM measurement frequencies, which consists of 



DfD% linearly independent constraints. Like the linear- 
inversion procedure for quantum state estimation, the 
resulting estimator obtained may not be positive. If that 
is the case, the estimator cannot be used for statistical 
predictions. This failure occurs whenever the observed 
relative frequencies of the measurement outcomes do not 
have consistent interpretation as probabilities. What 
is, therefore, called for, is an estimation procedure that 
ensures a physically meaningful estimator whatever the 
measurement data may be. 

One statistically meaningful technique to obtain a pos- 
itive estimator for -Etrue is the maximum-likelihood esti- 
mation procedure (ML) Q. This can be applied to yield 
a unique estimator Eml as long as the measurement data 
obtained form a set of DfD^ linearly independent con- 
straints. Wc say that this set of measurement data is 
informationally complete. However, the number of lin- 
early independent parameters increases rapidly with the 
dimensions and a complete characterization of E truc be- 
comes unfeasible for complex processes. This is espe- 
cially true when the quantum process acts on an infinite- 
dimensional Hilbert space . The well-known method of 
Direct Characterization of Quantum Dynamics (DCQD) 
[3] was introduced to reduce the amount of measurement 
resources that are used for quantum process tomography. 
However, this method requires entangled input states and 
post-processing strategies that can be expensive when 
dealing with more complex quantum processes. 

A more straightforward and conceptually different ap- 
proach is to resort to informationally incomplete QPT. 
With this approach, less measurement resources are used 
to obtain an estimator for the unknown quantum process 
to a fair amount of accuracy. As a consequence, there ex- 
ists a convex set of infinitely many ML estimators which 
are consistent with the measurement data. To choose 
the estimator which is least-biased from the convex set, 
we invoke the maximum-entropy principle Q and choose 
the estimator with the largest entropy. Such an incom- 
plete QPT can also give useful information about the 
quantum channel. In a typical tomography experiment, 
with data from measuring a finite number of copies, the 
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resulting quantum process estimator can never be ex- 
actly equal to -Etrue since experimental fluctuations are 
inevitable. One can only obtain an estimator that is close 
to .©true within a certain tomographic precision. Thus, 
MLME QPT is typically useful in providing a unique es- 
timator for an unknown quantum process within a suit- 
able tomographic precision using fewer incomplete mea- 
surement resources. As will be shown, this reduction in 
measurement resources is more pronounced for unitary 
quantum channels. Since E tIUC is unknown, one common 
practice is to gauge such a tomographic precision with 
another operator -E pr i r that is close to -Etrue> based on 
some prior information one has about the constructed 
quantum channel. The availability of such a -E pr ior for a 
given -Etrue will become useful and important in subse- 
quent discussions. 

The estimators obtained using the aforementioned 
method are least-biased with respect to the set of in- 
complete measurement data in the sense of the entropy 
of the quantum process. In Ref . fioj , which is an analyti- 
cal study of the conventional maximum-entropy method, 
the entropy functional for the Choi-Jamiolkowski opera- 
tor E describing a quantum channel was introduced as 
S{E) = -tr{(E/Di)log(E/D{)} and this was shown to 
exhibit nice properties. In particular, this concave chan- 
nel entropy functional has a unique maximum in E and 
is zero only when the quantum channel is unitary since 
E/D{ is then a rank-1 projector. However, the analyt- 
ical results in apply only to simple qubit channels 
and are difficult to extend to general quantum channels 
of greater complexity. 

In this article, we shall extend the strategy in Ref. 
and establish an adaptive iterative algorithm to search 
for the MLME estimator -Emlme which maximizes both 
the likelihood and entropy functionals using the channel 
entropy functional in [lOj. We first give some prelim- 
inary ideas on quantum process estimation in Sec. [TXJ 
Then, in Sec. IIII1 we will present the iterative MLME al- 
gorithm using variational principles to derive a steepest- 
ascent scheme and apply it to numerical simulations of 
two-qubit and three-qubit quantum channels. In Scc. lIVl 
we will establish adaptive strategies to apply the MLME 
algorithm with the aim of minimizing the amount of mea- 
surement resources needed to perform incomplete QPT. 



II. PRELIMINARIES OF QUANTUM PROCESS 
ESTIMATION 

The estimation of the completely-positive map M. that 
describes an unknown quantum process, in the man- 
ner presented in Eq. (JTJ) , is isomorphic to the estima- 
tion of an unknown quantum state. This is a con- 
sequence of the well-known Choi-Jamiolkowski isomor- 
phism [HQ. Let us define a maximally-entangled pure 
state \ 1 $>+) = J2j \j)-H ® \j)w /v^Di in terms of the com- 
putational basis kets Here, the dimensions of 
the Hilbcrt spaces H and %' are both equal to the dimen- 



sion Di of the input Hilbert space. Using this basis, there 
exists a one-to-one correspondence between the map M. 
and a unique positive operator E defined as follows: 

E =D\ {1 H <& £«') (1*+) (*+!) (3) 
= J2(\j)(k\)®M(\ 3 )(k\) , (4) 

with lu being the identity map. From Eq. ([T]), the alter- 
native expression 

E = J2\i>m)&m\, (5) 
m 

with 

\^ m ) = {lH®K m )\^+)^/D i , (6) 

implies that the rank of E is equal to the number of 
linearly independent K m s. It follows that E is rank-1 
if the completely-positive map is described by a single 
unitary Kraus operator, and only then. 

The output state can be expressed in terms of E by 
means of 

Po = tY H {E(pJ ®l K )} , (7) 

where the transposition is defined with respect to the 
computational basis. Hence, reconstructing the quantum 
process amounts to estimating the positive operator E. 
To do so, one requires a total of DfD% real parameters 
to specify the corresponding matrix. In the subsequent 
analyses, we shall consider trace-preserving maps, that is 
tr{pi} = tr{p } for any pi, in which case the number of 
independent parameters is reduced to Df(D% — 1), with 
the constraints compactly written as 

tr K {E} = l H . (8) 

To estimate E, typically a set of L input states p> , 
with N copies each, are sent through the quantum chan- 
nel, one state at a time. The output state p2 that cor- 
responds to Pi is measured with a POM consisting of 
M outcomes II m > such that J2 m ^ m = The 
probability of getting outcome IL m for the input state 

p { p is given by p hn = tr{# fpf )T <8 II m ) j j L. Here, 

p[ = ^2 m pim = 1/L. If the LM parameters comprise 
DfD% linearly independent ones, the measurement data 
will be informationally complete. One can thus per- 
form a complete quantum process estimation using the 
maximum-likelihood (ML) algorithm Q and so obtain a 
unique positive estimator Eml by maximizing the likeli- 
hood functional 

C(E)^f[(f[ P ^) , (9) 

(=1 \rn=l / 

where the number of occurrences ni m for the outcome 
Ii rn obtained in an experiment with the input state pf^ 
are such that n\ = J2 m n im = N. 
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In what follows, we shall discuss the situation in which 
one performs quantum process estimation with a set of 
informationally incomplete measurement data. All pro- 
cesses considered henceforth will be trace-preserving, but 
a generalization to those which are not trace-preserving 
is straightforward. 



III. THE ITERATIVE ALGORITHM 

We consider the optimization of the information func- 
tional 



I(\;E) = XS(E) 



1 

LN 



log £(£?), 



(10) 



where A is a parameter which scales the entropy relative 
to the normalized log-likelihood and should be chosen 
with a very small value. When the measurement data 
are informationally complete, one sets A to zero and op- 
timizing I(X = 0; E) amounts to the ML problem Q. In 
the same spirit as in fill ], both our knowledge from the 
measurement data (contained in log (£(£*)) /LN which 
measures the information gain) and our ignorance (re- 
flected in S(E) which measures the lack of information) 
about the operator E arc taken into account in such a way 
that our ignorance takes an infinitesimal weight. This in- 
troduces a small and smooth convex hill over the set of 
positive ML estimators which selects the one with the 
largest entropy. As in [ll| , the value of A may be chosen 
such that both log (C(E)) /LN and S(E) remain almost 
constant with respect to A. 

To maximize J(A; E) with respect to E, wc define the 
variation E + 8E = (1 + Z^)E(1 + Z), where Z is a small 
arbitrary operator such that Eq. © is satisfied, that is: 
tr{8E} = 0. Thus the most general expression for Z is 



{(1 + SAt) E (1 + 8A)} ® Ik 



1+Z = (1 + 8A) 

(11) 

with an unrestricted infinitesimal 6 A. On the other 
hand, the variation of /(A; E) with respect to E gives 
tr{bEW}, where 



yr r 1 \ - fim (/) T tt A 

W = T2^—Pi ®n m - — 

Im 



(12) 



and fi m = ni m /LN. Keeping only first-order variations 
systematically and imposing 6/(A; E) > 0, the method of 
steepest ascent leads us to 



8A = 6j4 t 



W 



1 



tr K {WE + EW} <g> 1 K (13) 



for some small e > 0. Hence, to obtain the MLME esti- 
mator -Emlme j one simply fixes A <c 1 and iterates the 



equations 

E n+1 =(l + Zl)E n (l + Z n ), 

(6A) n = | \W n - itr K {W n E n + E n W n ] ® 1 K 

' (14) 

where the expression for Z n follows from Eq. (fTTj) and 
W n denotes the operator W in Eq. (fT2"j) evaluated for 
E n . One may do so by starting from a randomly chosen 
operator Eq and continue until the extremal equation 
for -Emlme is satisfied with some pre-chosen numerical 
precision. To derive this extremal equation, wc define 
the Lagrange functional Q 

V(E) = /(A; E) — tr{A E} (15) 

with the Lagrange operator A = h<S)lic for the constraints 
in Eq. ([8]), where h is a Hcrmitian operator. Setting the 
variation of D(E) to zero gives the extremal equation 



AE 



MLME 



A — Wmlme-^mlmeWmlme 



(16) 



1 



with A — ^tv K |Wmlme-EmlmeWmlme| 

Thus far, we have been assuming that the measure- 
ment outcomes II m give perfect detection of quantum 
systems. The iterative equations in Eq. (fT4|) can be gen- 
eralized to the case of imperfect detection. In this case 
it is clear that, if each of the M measurement outcomes 
II m is assigned a detection efficiency r\ m < 1, one can 
always define a new set of M measurement outcomes 
fi m = f?mll m such that G = J2 m ^n ^ l K- The it- 
eration procedure of Eq. (fbfj) can still be used with the 
new set of POM outcomes II m provided that the operator 
W in Eq. (fT2"j) is replaced by W — Wo, where 



W n = 



1 



rE 



Pi 



(OT 



G 



(17) 



accounts for the copies that escape detection. 

As an example, we apply the algorithm to numerical 
simulations on two-qubit channels, the CNOT gate de- 
scribed by the unitary operator 



f^CNOT 



/l ON 
10 
1 

V0 1 0/ 



(18) 



and a randomly generated non-unitary quantum chan- 
nel described by a full-rank Choi-Jamiolkowski matrix, 
as well as the three-qubit Toffoli gate described by the 
unitary operator 

/l 0\ 
1 
10 
10 
10 
10 
1 
Vo 1 0/ 



bffoli : 



(19) 
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To quantify the discrepancy between an MLME estima- 
tor and the true Choi-Jamiolkowski operator E tIue , we 
use the trace-class distance 



V, 



'MLME 



— tr( 



E 



MLME 



Etn 



|}, (20) 



a 0.6 - 



- CNOT gate 

- Random quantum channel 



where |^4| = v At A for any operator A. In these simu- 
lations, we take the ^-dimensional projectors of a sym- 
metrically informationally complete POM (SIC POM) as 
the input states [T2[. One such set of states contains the 
minimal number of Df pure states \ such that 



p 0.2 - 



A + l 



(21) 



As shown in Fig. [IJ using the MLME algorithm for QPT 
can give fast convergence in terms of tomographic effi- 
ciency with a reduced number of input states as quan- 
tum resources. This reduction is especially significant 
for unitary processes, where the Choi-Jamiolkowski op- 
erators are rank-1. For nonunitary quantum processes 
described by matrices of larger rank, the tomographic ef- 
ficiency will be lower as shown in the first plot of Fig. [Tj 
This is expected in analogy with quantum state tomogra- 
phy where it is more difficult to reliably estimate highly- 
mixed states than nearly-pure ones. 



IV. ADAPTIVE STRATEGIES 

An interesting question to ask with regard to incom- 
plete QPT is whether one can perform it in an optimal 
way given the available resources by means of adaptive 
strategies. Here optimality refers to the minimization 
of the amount of resources (input states or measure- 
ments) used to perform incomplete QPT such that the 
distance between -Emlme and -Etruc reaches a certain de- 
sired value. Very frequently, despite the fact that -Etruc 
is always unknown, one has a rough idea of an operator 
Eprior which may be close to Etruc based on some prior 
information about the unknown Etruc- This scenario is 
reasonable and typical when one designs a quantum chan- 
nel experimentally which performs an expected quantum 
operation, with errors arising from imperfections of the 
components that make up the channel. We shall establish 
adaptive strategies which make use of such an operator 
in order to select, with the help of the MLME algorithm, 
resources for incomplete QPT in an optimal way. We re- 
fer to such tomography schemes as the adaptive MLME 
quantum process tomography (AMLME QPT) schemes. 

We will focus on adaptive strategies to choose the input 
states optimally. This can be reviewed in two separate 
cases: The case in which a fixed set of linearly indepen- 
dent input states is used (Sec. IIV A[) and that in which 
arbitrary input states can be generated for incomplete 
QPT fScc- IIVB] . The optimization of the POM will be 
studied on a later occasion. 
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FIG. 1. Numerical simulations on the two-qubit (d = 2 2 ) and 
three-qubit (d — 2 3 ) quantum channels where D\ — D = d. 
The projectors of symmetric informationally complete POMs 
(SIC POMs) are chosen as the linearly independent input 
states for all the simulations (L = d 2 ). For the measure- 
ments, informationally complete POMs consisting of tensor 
products of qubit SIC POMs are used (M = d ). Each qubit 
SIC POM consists of a set of pure states whose Bloch vectors 
form the "legs of a tetrahedron" in the Bloch sphere. For 
the two-qubit channels, N = 10 4 and an average over 50 ex- 
periments is taken to compute the trace-class distances. For 
the three-qubit channel, the measurement data are generated 
without statistical noise. For unitary channels, one can see 
that the MLME algorithm can still give fairly accurate es- 
timations with a smaller number of input states than that 
of a linearly independent set. Numerical simulations of arbi- 
trary two-qubit and three-qubit unitary channels suggest that 
the number is approximately d 2 /2 for SIC POM input states, 
above which there is insignificant tomographic improvement. 



A. Optimization over a fixed set of linearly 
independent input states 

In the previous section, we considered the projectors of 
the SIC POMs, which are known to have optimal tomo- 
graphic efficiencies, as input states in the numerical sim- 
ulations. Since these POMs are symmetric in the sense 
of Eq. (|2ip . any ordering of the input states in a given set 
gives the same plots in Fig. [TJ In practice, however, such 
entangled states are difficult to produce and one typically 
has access to a set of separable states [l3| for measure- 
ments instead. In this case, there no longer exists such a 
symmetry and the tomographic performance depends on 
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the order of the input states chosen, possibly strongly so. 
We propose to optimize the tomographic performance by 
choosing the input states adaptively based on the mea- 
surement data collected from the previously chosen input 
states, thereby using the prior E pllor . 

To describe the adaptive strategy, let us consider a set 
of L > Df input states in which Df of them are linearly 
independent. Suppose that N, which is a fixed integer for 
all input states, copies of a randomly chosen input state 
Pj ' are sent through the quantum channel and the first 
set of measurement data {i>h, . . . , vim}, Yl m Vlm = ' s 
collected. With these data f\ m = i/ im , one obtains the 



first MLME estimator E 



(i) 

MLME' 



To select the next input 



state out of the remaining L — 1 states, we take ©prior as 
a gauge for E truc to generate L — 1 sets of probabilities re- 
spectively from the L—l states. Each set of probabilities 

• ' "2M }> 



is then treated as the set of frequencies {z^i 
for the corresponding input state k. Hence, one has L — l 
sets of measurement data, each set being the combined 
data {^n, . . . , vim, i • • • j ^2m}/^ w hh the normalized 



frequencies f lm = u lm /2 and p 2m 
flm + f2m) 



^2m/2 such that 

J2 m (hm + fim) = 1 f° r cac h an d the corresponding 
L — l projected MLME estimators ©mlme k . 

The value of k is selected such that a chosen figure 

* (2) 

of merit which quantifies the distance between -E^lme k 

and ^mlme * s the largest, so that there is a high 
chance for the next MLME estimator to be closer to 



Etrv 



As an example, the figure of merit is taken 



to be the trace-class distance 23 tr ^mlme fc> ^mlme 

With this input state, the second estimator © MLME ^ s 
then obtained with MLME QPT. One repeats this pro- 
cedure for subsequent input states until the distance 

'mlmei-^mlme) i s below some preset threshold. 
An alternative to this would be to minimize the trace- 
class distance T> tI (E^l E k , E pi i OI 

It is important to understand that in this strategy, the 
prior information ©prior is not used to reconstruct the 
unknown quantum process in any way. It serves only 
as a means to optimally select the input states from 
the given set so as to maximize the tomographic con- 
vergence. This adaptive strategy also relies partially on 
the measurement data obtained in the experiment. We 
have thus introduced an operational method of using the 
prior information to minimize the amount of resources 
needed to perform reliable MLME QPT without intro- 
ducing any artifacts coming from the prior information 
into the reconstruction procedure. To summarize, the 
adaptive MLME strategy is as follows: 

1. Randomly choose p[ from the set of L input states 
and set 1 = 1. 

(a) Perform QPT using p^ and obtain the set of 
frequencies {v ll} , . . , v lM }, J2 m v lm = 1- 

(b) Set v = \J j=1 {fji,. ■ ■ ,VjM} fl- 



ic) Invoke the MLME algorithm with v and ob- 



tain E 



MLME' 



Use -Eprior to compute the fre- 



quencies {v\^ xy . . , v\l\ M }, J2 m v i+i m = !. 
from the remaining input states, with k label- 
ing the remaining L — I states. 

(d) Define L — I sets of accumulated frequen- 



cies [v U { 



(fc) 
i+i l ' 



M+im))I( 1 + l ) and cal " 
culatc the L — I projected MLME estimators 



E 



MLMEl ■ 



(Z+l) 

k such that V tr ( E 



(e) Set p\ ~ ri> as the input state corresponding to 

' Ml) 



E v 

MLME.fc ' MLME 



is largest. 



2. Set 1 = 1 + 1 and repeat Steps l(a)-l(e). 



B. Optimization over the Hilbert space 

More generally, the adaptive strategy may be extended 
to the case in which one has access to the entire Hilbert 
space of states. In other words, the task is to search 

for the next optimal input state p\ L+1 ^ from the D r 
dimensional Hilbert space based on the measurement 
data vi m obtained in the experiment from L previously 
chosen input states, where v\ m = 1 for all I, and 
the prior information ©prior about the unknown quan- 
tum process. 

To this end, we define the normalized projected log- 
likelihood functional 



log £(©, p) = ^2 J^fl lo S (P«m) + J~T[ log ^ 



(22) 



where 



Pi i 



tr< E ■ 



EL 



L+l 



= tr{E pl - ior p T <g> n m } and p m = tr< E 



P T ®H r , 

L + l 



with I always running from 1 to L over all previously 
used input state labels. This projected log-likelihood 
functional is a good approximation to the log-likelihood 
functional for the situation in which the state p is cho- 
sen as the next input state for the experiment as long 
as -©prior is not too far away from ©true- The projected 
frequencies v m estimate the actual frequencies one gets 
when measuring the input state p. An optimal input 
state p[ L and the corresponding Choi-Jamiolkowski 
operator are chosen as the positive estimators that max- 
imize this projected log-likelihood functional. 

Coincidcntally, this maximum projected log-likelihood 
(MPL) procedure is equivalent to minimizing the cross 
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entropy functional C(E,p) = — log£(E, p) [la] over 
all positive operators subjected to the respective con- 
straints for p and E. Hence, another way of understand- 
ing this procedure is to first regard both the incomplete 
measurement data collected after using L input states 
and -Eprior as the full prior knowledge one has about the 
unknown -Btrue- The statistical motivation for MPL or 
minimizing C(E, p) is, loosely speaking, to obtain estima- 
tors which are as compatible with this prior knowledge 
as possible by minimizing the entropy of the prior knowl- 
edge C(E, p). We will provide some more arguments re- 
lated to this optimization technique in the later part of 
this section. 

To carry out the optimization, we consider the re- 
sponse of log£(£',/9) to small variations of both p and 
E. After some similar calculations as in Sec. IIII1 we ob- 
tain the MPL iterative equations 



E n+1 = (1 + Zl)E n {\ + Z n ) , 

(1 + e2S n )/9 n (l + e 2 E n ) 



Pn+l 



tr w {(l + e 2 s n )p„(l + e 2 5„)} 
where Z n is defined by Eq. (| 1 1 1) with 

(6A)„ = ^ ( X n - hv K {X n E n + E n X n } <g> l K 



(23) 



(l)T „ tt 



v m p T <g) il 



Pm (L+iy- 



, (24) 



and 



H„ = Y n - tv-H{Y n p n } 
Y„ = tr K - 



{ 1"^ i H ®n r 

I ^ L + l 

\ \_ m 



X log (p m )E p i' 



-E 



(L + l)p. 

The MPL estimators satisfy the extremal equations 
A-E MPL A = X MPL E MPL X MPL , 

PMPL^MPL = i^MPLpMPL = tr-H{>MPLPMPL} PMPL 



(25) 



where 



A 



f7{ 



Xmvi.E 



MPL-C'MPlATmpL 



1k 



(26) 



(27) 



The small parameters t\ and e 2 are positive numbers. 
Thus, to carry out the MPL procedure, one iterates 
Eqs. (|23|) until Eqs. (|26|) arc satisfied with a preset nu- 
merical precision. 

There is one important feature of this optimization 
scheme. From Eq. (|22|) . we note that log C(E,p) is nei- 
ther convex nor concave in p and hence can have multi- 
ple local maxima. Thus, the MPL optimization is non- 
convex. To generate these local-maxima estimators, one 



can start from multiple randomly chosen starting points 
and perform the iterations. Thereafter, the state estima- 
tor (Smpl to be chosen as the next input state p[ L+1 ^ is 
such that its corresponding -Empl gives the largest trace- 
class distance away from the previous MLME estimator 
^mlme: which is obtained from the data of the previ- 
ously chosen L input states, over all generated pairs of 
MPL estimators (/Smpl, ^mpl)- Again, one may also 
minimize the distance between -Empl and -B pr ior- Let us 
summarize the adaptive MPL-MLME strategy with the 
following scheme: 

1. Randomly choose /jj as the first input state and 
set I = 1. 

(a) Perform QPT using p\ and obtain the set of 
frequencies {va, . . . , vim}, J2 m v lm = 1- 

(b) Set v = U' = iW, • ■ • , v jM }/ 1. 

(c) Invoke the MLME algorithm with v and Ob- 
tain pW 

Uctlll -^ MLME - 

(d) Using i? P rior, generate a set of pairs of MPL 
estimators (pmplj-Bmpl), where the states 
Pmpl were not part of the I input states pre- 
viously used, by iterating Eqs. (|26|) from dif- 
ferent, randomly chosen starting points. 

(e) Set /Jj ^ as the input state correspond- 



ing to the state estimator pmpl such that 

(^MPL^mlm, 



ME J is the largest 



2. Set 1 = 1 + 1 and repeat Steps l(a)-l(e). 

With this, let us first compare the performances of 
the three proposed schemes, namely the non-adaptive 
MLME scheme in Sec. [TTT1 the adaptive MLME scheme 
in Sec. HVAl and the adaptive MPL-MLME scheme. For 
this purpose, we consider two quantum processes, the 
first being an imperfect CNOT gate whose action is de- 
scribed by the Kraus operators 



K i = Vl - eC/cNOT and K 2 = ■ 



(28) 



This first channel is a CNOT gate with probability 1 — e 
and docs nothing to the input states with probability e, 
an imperfect CNOT gate represented by a rank-2 Choi- 
Jamiolkowski operator. The second process is described 
by the Kraus operators 



A'i = VT~ eC/cNOT and {Kj = \feBj} 



J=2 ' 



(29) 



where the 15 operators Bj are randomly generated and 
satisfy the equation J^j-BjBj = 1^. This second chan- 
nel, which is represented by a full-rank matrix, is a 
CNOT gate with probability 1 — e and randomly per- 
turbs the input states with probability e due to additional 
noise. As an example, we set e = 0.1. Figure [2] shows the 
numerical results. 



7 



Imperfect CNOT 



f, 0.2 



2 4 6 8 

Number of input states 



- Non-adaptive MLME 

- Adaptive MLME 

- Adaptive MPL-MLME 



10 12 14 16 



0.8 <V, 



g 0.6 - 




Imperfect CNOT with random noise 



—A— Non-adaptive MLME 
, -B- Adaptive MLME 

Adaptive MPL-MLME 



4 6 8 10 12 14 16 

Number of input states 



FIG. 2. A comparison of three incomplete QPT schemes: the 
non-adaptive MLME scheme, the adaptive MLME scheme 
and the adaptive MPL-MLME scheme. Monte Carlo sim- 
ulations are carried out on two different types of imperfect 
CNOT gates described in the text. Here, N = 10 and an av- 
erage over 50 experiments is taken to compute the trace-class 
distances. For both the non-adaptive as well as the adaptive 
MLME schemes, the 16 linearly independent input states are 
chosen to be tensor products of projectors of the kets |0), 
|X>, (|0) + |1))/V2 and (|0) + |l)i)/V2 0. For all schemes, 
the POM outcomes are chosen to be the tensor products of 
qubit SIC POMs. The tomographic performance of the adap- 
tive MPL-MLME scheme is the best out of the three. The 
plots show that the tomographic efficiency can be further im- 
proved by optimizing the input states over the Hilbert space 
instead of restricting to a fixed set of linearly independent in- 
put states, albeit the small difference in tomographic perfor- 
mance between the two adaptive schemes for some quantum 
processes. 



Next, to understand how this adaptive MPL-MLME 
strategy can lead to an optimization in tomographic per- 
formance, we need to know how increasing the number 
of input states used in AMLME QPT can affect the cor- 
responding MLME estimators. Since we are considering 
only a subset of the full linearly independent input states 
in general, there exists a convex set of estimators -Eml 
maximizing the likelihood functional for a given set of in- 
formationally incomplete measurement data. This means 
that the likelihood functional possesses a plateau hover- 
ing over this convex set of estimators. As the number of 
input states L used increases, the likelihood plateau will 
either remain unchanged (if no additional information 
about £truc is gained after performing QPT with new 



input states) or decrease in size (if new independent in- 
formation is obtained) . Thus in general, the plateau will 
continue to shrink to a point when a full set of linearly 
independent input states is used. 

We conjecture that the adaptive MPL-MLME strat- 
egy optimizes the rate of decrease in the size of the like- 
lihood plateau by maximizing the normalized projected 
log-likclihood functional with respect to the next input 
state. A point of view to justify this conjecture is to 
interpret the maximum of the- normalized log-likelihood 
functional log(£(-E)) /LN as the maximum information 
gain from the measurement data. When the number of 
copies N is infinite, the data are noiseless and the re- 
sulting maximum information gain is J2i m flm log(/im), 
which is the negative of the Shannon entropy of the mea- 
surement data. For finite N, the maximum information 
gain over the space of density operators will typically 
be lower than the true maximum due to the positivity 
constraint, especially when -Etrue is highly rank-deficient. 
In this language, the MPL-MLME strategy attempts to 
maximize this maximum information gain as much as 
possible via the optimization of future input states over 
the entire Hilbert space of density operators, using the 
normalized projected log-likelihood functional as an esti- 
mate for the actual normalized log-likelihood functional 
describing future measurements. This is a possible expla- 
nation for the optimal decrease in the likelihood plateau 
size since one has maximal knowledge about the unknown 
Etmo gained with the optimized input states and so the 
ambiguity in the estimators is minimized. 

We illustrate this point by considering the imperfect 
CNOT gate with e = 0.1 described by Eq. ®. Since 
the boundary of the likelihood plateau is complicated, 
we shall estimate its size numerically by first generating 
No = 10 3 ML estimators £^ L labeled with the index j 
for a given set of measurement data. Next, in the same 
spirit as in numerical sampling, we can define the opera- 
tor centroid 



E 



ML 



, N 

— V pO) 
No 2 ^ ML 

3=1 



(30) 



for this generated set of estimators and the normalized 
Hilbcrt-Schmidt standard deviation 



A 



1 



2Nn 



(31) 



away from the centroid. Thus, < A < 1. For suf- 
ficiently large No, the size of the plateau may be well 
approximated by the spread A. Figure[3]compares the re- 
spective performances of the the three proposed schemes 
by analyzing the size of the likelihood plateau and the 
maximum of the normalized log-likelihood functional. 
From Fig. [31 it is crucial to understand that A does not, 
strictly speaking, decrease monotonically with increas- 
ing height of the normalized log-likclihood functional. A 
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Number of input states 



FIG. 3. The dependence of the size of the likelihood plateau 
(A) and the normalized log-likelihood maximum on the num- 
ber of input states. The respective performances of the non- 
adaptive MLME scheme, the adaptive MLME scheme and 
the adaptive MPL-MLME scheme are computed based on 
noiseless measurement data for an imperfect CNOT gate with 
e = 0.1. For both the non-adaptive MLME scheme and the 
adaptive MLME scheme, the 16 linearly independent input 
states are chosen to be tensor products of projectors of the 
kets |0), |1), (|0) + |1»/V2 and (|0) + 1) i)/V2. For all 
schemes, the POM outcomes are chosen to be the tensor prod- 
ucts of qubit SIC POMs. From the plot, the rate of decrease 
of A is the greatest with the adaptive MPL-MLME scheme. 
The increase in the normalized log-likelihood maxima with 
the adaptive MPL-MLME scheme may also be interpreted as 
greater maximum information gain after measurements using 
the optimal input states as compared to the other schemes. 



counterexample is shown in the figure, that is a signif- 
icant decrease in A for the adaptive MLME scheme as 
compared to the non-adaptive one with the correspond- 
ing slight decrease in its normalized log-likelihood max- 
ima. We emphasize that what the adaptive MPL-MLME 
strategy exploits is the possible trend of this behavior. 

To end this part of the section, we comment that the 
aforementioned idea can be applied to adaptively choose 
the next set of POM outcomes H, based on the collected 
measurement data. However, to perform the optimiza- 
tion successfully requires the solutions to more techni- 
cal problems which include ensuring that the POM out- 
comes are linearly independent after the optimization. 
This project is left for future studies. 



C. A combination of both adaptive strategies 

Let us begin this final part of the section by review- 
ing the non-convex feature of the MPL-MLME strategy 
discussed in Sec. IIVB1 The presence of multiple local- 
maxima estimators which are linearly independent is an 
important element of the MPL-MLME strategy as it pro- 
vides linearly independent input states which are optimal 
for measurement based on the data obtained from the 
experiments. In general, because of the non-linearity of 
Eq. (|26|) . it is difficult to determine the number of such 
linearly independent extremal solutions for a given set of 
measurement data by analytical means. One can only 
search for as many linearly independent local-maxima 
estimators /Smpl as possible via numerical optimizations 
from different starting points within a reasonable time 
period. 
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Number of input states 



FIG. 4. A comparison of three incomplete QPT schemes: 
the non-adaptive MLME scheme, the adaptive MLME scheme 
and a combination of the adaptive MPL-MLME scheme and 
the adaptive MLME scheme (hybrid scheme). Monte Carlo 
simulations are carried out on the imperfect CNOT gate with 
e — 0.1. Here, N — 10 4 and an average over 50 experiments is 
taken to compute the trace-class distances. For both the non- 
adaptive as well as the adaptive MLME schemes, the default 
set of 16 linearly independent input states are chosen to be 
tensor products of projectors of the kets 1 0} , 1 1) , ( 1 0} + 1 1} ) /\/2 
and (|0) + |1) i)/\/2. For all schemes, a set of 16 randomly 
generated positive operators, which are all linearly indepen- 
dent of one another, are used to form the POM. For this POM, 
the average repetition frequency of the adaptive MPL-MLME 
scheme is very high after four input states are used. The first 
input state for all schemes is chosen to be the same separable 
state p' 1 ' = 1 00) (00|. For the third scheme, the second to 
the fourth input states (shaded region) are optimized using 
the adaptive MPL-MLME strategy and the subsequent input 
states are chosen via the adaptive MLME strategy using the 
default set of input states which excludes 1 00) (00|. The plot 
shows that the overall performance of the combined strategy 
is better than the adaptive MLME strategy alone. 

Another technical subtlety is that these local-maxima 
estimators tend to repeat themselves during the opti- 
mization. Hence, a local-maxima estimator which was 
chosen as one of the input states earlier may reappear in 
later optimizations. The repetition frequency strongly 
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depends on the POM chosen to measure the output 
states. The examples given thus far make use of the 
product tetrahedron measurements as the POM and the 
resulting MPL optimizations give linearly independent 
estimators with few repetitions. This may not be the case 
for other types of POM. In view of this, another way of 
doing AMLME QPT is to use both adaptive strategies in 
Sees. HVAl and HVBl interchangeably, the hybrid MLME 
strategy. For example, one can start with the adap- 
tive MPL-MLME strategy for tomography and when the 
repetition rate increases as more input states have been 
used, one may switch to the first adaptive MLME strat- 
egy. Figure [4] suggests that such a hybrid MLME strat- 
egy can further improve the tomographic performance as 
compared with the adaptive MLME strategy alone. 

V. SUMMARY 

We have established adaptive numerical strategies to 
perform incomplete quantum process tomography. One 
may choose whichever strategy that is convenient to her 
to carry out tomography depending on the available types 
of measurement resources at hand. Each of these strate- 
gies combines the simplicity of incomplete quantum pro- 
cess tomography using quantum state estimation with 
good tomographic performances using optimization tech- 
niques. It can never be overemphasized that, although 
some prior information is necessary for each adaptive 
strategy, such information is never used in the estima- 



tion of the unknown quantum process. Rather, the prior 
information is utilized to adaptivcly select future mea- 
surement resources, the input states in our context, based 
on the current measurement data, to optimize the tomo- 
graphic performance. The discussions presented in this 
article, therefore, provide a means of obtaining estima- 
tors for the unknown quantum process using incomplete 
resources which are typically within reasonably good ex- 
perimental precisions. These estimators are statistically 
meaningful in that they are least-biased with respect to a 
set of informationally incomplete measurement data and 
are hence suitable for partial characterization of quan- 
tum processes. This is in contrast with the standard 
quantum process tomography which generally requires a 
huge amount of informationally complete measurement 
resources. 
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